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Whole Genome Sequence of the Probiotic Strain Lactobacillus 
paracasei Nil 15, Isolated from Traditional Chinese Fermented Milk 
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Lactobacillus paracasei N 1 1 1 5 is a new strain with probiotic properties isolated from traditional homemade dairy products in 
Inner Mongolia, China. Here, we report the complete genome sequence of L. paracasei Nl 1 15, which shows high similarity to 
the well-studied probiotic Lactobacillus rhamnosus GG, and 3 structures turned out to be inversions, according to the colinear- 
ity analysis of the BLAST alignment. 
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Lactobacillus paracasei Nl 1 15 has been isolated from traditional 
dairy products in Inner Mongolia, China. A patent has been 
applied for this strain to be a probiotic since it exhibits high-level 
resistance to acid and bile stresses and it stimulates macrophages 
to produce interleukin 10 (IL-10), IL-6, and tumor necrosis factor 
alpha (TNF-a) (1). The whole genome of L. paracasei Nil 15 was 
sequenced by the Illumina HiSeq 2000 platform (performed by 
the Beijing Genomics Institute [BGI], Shenzhen, China). 

High-molecular-weight genomic DNA was used to construct 
three genomic libraries, with 0.5-kb (914 Mb), 2-kb (250 Mb), and 
6-kb (250 Mb) insert sizes, respectively. Using the Illumina HiSeq 
2000 system, 1.4 Gbp clean data were given, which provided 420- 
fold genome coverage. Using SOAPdenovo (http://soap. genomics 
.org.cn/soapdenovo.html) (2), the reads were assembled into 38 
scaffolds and 268 contigs. Using the SOAPaligner (http://soap 
.genomics.org.cn/soapaligner.html) tool, 96.2% of the reads were 
mapped to assembly sequences. Most of the gaps within the scaf- 
folds were filled by local assembly and overlap extension with 
paired-end reads. We also filled the gaps between scaffolds by 
sequencing PCR products using an ABI 3730 capillary sequencer. 

The complete genome of Nil 15 contains a single circular chro- 
mosome of 2,938,059 bp (46.59% G+C) and four plasmids (pNl 
[3,833 bp], pN2 [55,121 bp], pN3 [58,511 bp], and pN4 
[8,755 bp] ). The overall G+C content is 46.45%, whereas the four 
plasmids have G+C contents of 43.26% on average. This genome 
size is slightly larger than the complete genomes of Lactobacillus 
rhamnosus ATCC 53103 (3.010 Mb) and Lactobacillus casei ATCC 
334 (2.925 Mb). The complete genome of Nil 15 contains 3,063 
predicted gene sequences with 2,584,095 bp (2-6), covering 
84.33% of the genome. Nl 1 15 also harbors 15 rRNA operons and 
61 tRNAs (7, 8). 

A comparative genome analysis was made between the assem- 
bly sequence and reference sequence, and 3 structural variations 
were found. Based on the colinearity analysis of the BLAST align- 
ment, 3 structural variations turned out to be inversions (an in- 
version is a chromosome rearrangement in which a segment of a 



chromosome is reversed end to end). All 3 inversions were verified 
by PCR amplification. 

A significant comparative genome analysis showed that Nl 1 15 
carries most of the core genes of L. casei, and no known pathogenic 
genes were identified. A comparative genomic analysis was per- 
formed with the published genomes of L. casei ATCC 334 and 
L. rhamnosus GG. In conclusion, the genome sequence provides 
new avenues to further explore the gene-based functional and 
probiotic mechanisms of Nil 15. In addition, comparative and 
functional genomics analyses might also be carried out to trace the 
origin and evolution of this bacterium. 

Nucleotide sequence accession numbers. The L. paracasei N 1 1 1 5 
chromosome and plasmid pNl, pN2, pN3, and pN4 sequences have 
been deposited in GenBank under accession no. CP007122, 
CP007123, CP007124, CP007125, and CP007126, respectively. 
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